How to Deal with Small Open Reading Frames?

نویسندگان

  • Malgorzata Wanczyk
  • Pawel Blazej
  • Pawel Mackiewicz
  • Stanislaw Cebrat
چکیده

Current ’classical’ algorithms recognizing protein coding sequences do not work effectively with sequences of small length. To deal with this problem we have proposed some improvements of the existing gene finders without any assumed arbitrary threshold. Introduced parameters describe position of tested sequences in the ranking of all small Open Reading Frames and short protein coding genes found in the analyzed genome. The sequences can be ranked according to the coding potential calculated by ’standard’ gene prediction algorithms. As an example, we used two algorithms for gene recognition and tested the set of selected small ORFs which were selected from prokaryotic genomes using sequence similarity methods. The applied approach enabled to identify promising sequence that can code for small proteins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

sORF finder: a program package to identify small open reading frames (sORFs) with high coding potential

Summary: sORF finder is a program package for identifying small open reading frames (sORFs) with high coding potential. This application allows identification of coding sORFs according to the nucleotide composition bias among coding sequences and the potential functional constraint at the amino acid level through evaluation of synonymous and nonsynonymous substitution rates. Availability: Onlin...

متن کامل

sORF finder: a program package to identify small open reading frames with high coding potential

SUMMARY sORF finder is a program package for identifying small open reading frames (sORFs) with high-coding potential. This application allows the identification of coding sORFs according to the nucleotide composition bias among coding sequences and the potential functional constraint at the amino acid level through evaluation of synonymous and non-synonymous substitution rates. AVAILABILITY ...

متن کامل

Cloning and sequencing of a putative Escherichia coli [NiFe] hydrogenase-1 operon containing six open reading frames.

DNA encompassing the structural genes of an Escherichia coli [NiFe] hydrogenase has been cloned and sequenced. The genes were identified as those encoding the large and small subunits of hydrogenase isozyme 1 based on NH2-terminal sequences of purified subunits (kindly provided by K. Francis and K. T. Shanmugam). The structural genes formed part of a putative operon that contained four addition...

متن کامل

Mini-exon epitope tagging for analysis of the protein coding potential of genomic sequence.

A novel approach to gene discovery and analysis is described. A small exon flanked by consensus 3' and 5' splice sites was synthesized. The exon contains open reading frames encoding 43 amino acid peptides. There are no stop codons in any of the three reading frames, and each reading frame contains an epitope recognized by the same monoclonal antibody. The exon can be inserted into the introns ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012